NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Inception: Efficiently Computable Misinformation Attacks on Markov Games

McMahan, Jeremy; Wu, Young; Chen, Yudong; Zhu, Jerry; Xie, Qiaomin (August 2024, Reinforcement Learning Journal)

We study security threats to Markov games due to information asymmetry and misinformation. We consider an attacker player who can spread misinformation about its reward function to influence the robust victim player's behavior. Given a fixed fake reward function, we derive the victim's policy under worst-case rationality and present polynomial-time algorithms to compute the attacker's optimal worst-case policy based on linear programming and backward induction. Then, we provide an efficient inception (""planting an idea in someone's mind"") attack algorithm to find the optimal fake reward function within a restricted set of reward functions with dominant strategies. Importantly, our methods exploit the universal assumption of rationality to compute attacks efficiently. Thus, our work exposes a security vulnerability arising from standard game assumptions under misinformation.
more » « less
Full Text Available
Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value

Wu, Young; Mcmahan, Jeremy; Chen, Yiding; Chen, Yudong; Zhu, Jerry; Xie, Qiaomin (July 2024, Proceedings of Machine Learning Research)

We study the game modification problem, where a benevolent game designer or a malevolent adversary modifies the reward function of a zero-sum Markov game so that a target deterministic or stochastic policy profile becomes the unique Markov perfect Nash equilibrium and has a value within a target range, in a way that minimizes the modification cost. We characterize the set of policy profiles that can be installed as the unique equilibrium of a game and establish sufficient and necessary conditions for successful installation. We propose an efficient algorithm that solves a convex optimization problem with linear constraints and then performs random perturbation to obtain a modification plan with a near-optimal cost.
more » « less
Full Text Available
Policy Gradient Bayesian Robust Optimization for Imitation Learning

Javed, Zaynah; Brown, Daniel; Sharma, Satvik; Zhu, Jerry; Balakrishna, Ashwin; Petrik, Marek; Dragan, Anca; Goldberg, Ken (January 2021, 38th International Conference on Machine Learning)

The difficulty in specifying rewards for many real world problems has led to an increased focus on learning rewards from human feedback, such as demonstrations. However, there are often many different reward functions that explain the human feedback, leaving agents with uncertainty over what the true reward function is. While most policy optimization approaches handle this uncertainty by optimizing for expected performance, many applications demand risk-averse behavior. We derive a novel policy gradient-style robust optimization approach, PG-BROIL, that optimizes a soft-robust objective that balances expected performance and risk. To the best of our knowledge, PG-BROIL is the first policy optimization algorithm robust to a distribution of reward hypotheses which can scale to continuous MDPs. Results suggest that PG-BROIL can produce a family of behaviors ranging from risk-neutral to risk-averse and outperforms state-of-the-art imitation learning algorithms when learning from ambiguous demonstrations by hedging against uncertainty, rather than seeking to uniquely identify the demonstrator’s reward function.
more » « less
Full Text Available
Policy Poisoning in Batch Reinforcement Learning and Control

Ma, Yuzhe; Zhang, Xuezhou; Sun, Wen; Zhu, Jerry (January 2019, Advances in Neural Information Processing Systems 32 (NIPS 2019))

Full Text Available

Search for: All records